Hans Peter Luhn and Herbert M. Ohlman: Their roles in the origins of keyword-in-context/permutation automatic indexing

نویسنده

  • Robert V. Williams
چکیده

InNovember 1958, at the International Conference on Scientific Information (ICSI) in Washington, DC, Hans Peter Luhn, of the International Business Machines (IBM) Company, and Herbert M. Ohlman, of the System Development Corporation (SDC), simultaneously and independently presented their new systems for automatic indexing. Luhn called his system “Keyword-in-Context (KWIC) indexing” and Ohlman called his “permutation indexing.” Within a few years, despite the similarities and obvious lack of advantages of one system over the other, Luhn’s KWIC became the de facto standard for title derivative automatic indexing, and Ohlman’s permutation indexing system largely disappeared. A 1965 state-of-the art report on automatic indexing (Stevens, 1965), with an extensive discussion of KWIC and related concepts, did not discuss Ohlman’s work, though it is cited in the bibliography. A 1966 retrospective review of the KWIC concept gave a slight mention of Ohlman’s work on permuted indexes, but uses the phrase “. . . when Luhn invented the KWIC index. . . .” (Fischer, 1966, p. 58). By 1995, Wellisch’s (1996) handbook on indexing had no mention of Ohlman and stated “Luhn, who first published the successful application of his idea in 1958, called his method KWIC . . . and it became, indeed, the first and to this day the only fully automatic indexing method” (pp. 258–259).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation

Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...

متن کامل

A Study on the Origins of Identification among the High-School Female Students in Khalkhal

This research studies the origins of identity among the female students in Khalkhal high-schools in 2012-2013. Data was gathered through the survey method (questionnaires) and the library method. The dependent variable was the study of identity seeking, but the independent variable includes the degree of religious orientation, social status of the family, the access to foreign networks, the sol...

متن کامل

Advantages of thesaurus representation using the Simple Knowledge Organization System (SKOS) compared with proposed alternatives

The concept of thesaurus has evolved from a list of conceptually interrelated words to today's controlled vocabularies, where terms form complex structures through semantic relationships. This term comes from the Latin and has turn been derived from the Greek "θησαυρός", which means treasury according to the Spanish Royal Academy, in whose dictionary it is also defined as: 'name given by its au...

متن کامل

مدل دو مرحله ای شکاف- گلچین برای نمایه سازی خودکار متون فارسی

Purpose: Each language has its own problems. This leads to consider appropriate models for automatic indexing of every language. These models should concern the exhaustificity and specificity of indexing.   This paper aims at introduction and evaluation of a model which is suited for Persian automatic indexing. This model suggests to break the text into the particles of candidate terms and to c...

متن کامل

بررسی نقش انواع بافتار هم‌نویسه‌ها در تعیین شباهت بین مدارک

Aim: Automatic information retrieval is based on the assumption that texts contain content or structural elements that can be used in word sense disambiguation and thereby improving the effectiveness of the results retrieved. Homographs are among the words requiring sense disambiguation. Depending on their roles and positions in texts, homograph contexts could be divided to different types, wit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JASIST

دوره 61  شماره 

صفحات  -

تاریخ انتشار 2010